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Conditional activation of Pik3ca^^^^^^ in a knock-in mouse 
model promotes mammary tumorigenesis and emergence 
of mutations 

W Yuan\ E Stawiski\ V Janakiraman\ E Chan^ S Durinck\ KA Edgar^ NM Kljavin\ CS Rivers\ F Gnad^ M Roose-Girma\ PM Haverty 
G Fedorowicz\ S Heldens\ RH Soriano\ Z Zhang^ JJ Walling L Johnson\ M Merchant^ Z l\/lodrusan\ HM Stern^ and S Seshagiri^ 

Oncogenic mutations in PIK3CA, which encodes the phosphoinositide-3-kinase (PI3K) catalytic subunit pIlOa, occur in ~25% 
of human breast cancers. In this study, we report the development of a knock-in mouse model for breast cancer where the 
endogenous PikSca allele was modified to allow tissue-specific conditional expression of a frequently found PikSca^^^"^^^ 
(PikSca^^^^^^"^^^) mutant allele. We found that activation of the latent PikSco^^^"^^^ allele resulted in breast tumors with multiple 
histological types. Whole-exome analysis of the P/7c3ca^^^'^^'^-d riven mammary tumors identified multiple mutations, including 
Trp53 mutations that appeared spontaneously during the development of adenocarinoma and spindle cell tumors. Further, we 
used this model to test the efficacy of GDC-0941, a PI3K inhibitor, in clinical development, and showed that the tumors 
respond to PI3K inhibition. 
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INTRODUCTION 

The class-IA phosphoinositide-3-kinase (PI3K) catalytic subunits, 
pi 10a, pilop and pi 105, occur as an obligate heterodimer in 
complex with p85a-, p55a-, p50a-, p85p- or p55y-regulatory 
subunits.^ In normal cells, PI3Ks are maintained in a basal 
inactive state and become active following growth factor 
stimulation and cell-surface-receptor engagement. Activated PI3Ks 
phosphorylate and convert phosphatidylinositol-4,5-bisphosphate 
(PIP2) to phosphatidylinositol-3,4,5-trisphosphate (PIP3). Elevated 
cellular PIP3 levels promote the activation of PI3K effectors, which 
in turn regulate a number of cellular processes, including cell 
growth, proliferation and survival.^ 

Frequent somatic mutations in PIKSCA, the gene that encodes 
pi 10a, have been reported in colorectal, breast and liver 
cancers."^"^ The majority of the reported pi 10a mutations occur 
in three hotspots-two (E542K and E545K) in the helical domain 
and one (H1047R) in the kinase domain.^'^ These hotspot 
mutations result in an oncogenic pi 10a capable of constitutively 
activating downstream signaling.^ Further, the pi 10a mutants 
when expressed in cells can lead to cellular transformation and 
support tumor formation in xenograft models."^'^"^^ 

Genetically engineered mouse models (GEMMs) of human 
cancer serve as an important resource for understanding tumor 
initiation and progression.^^ GEMMs have also been used to test 
therapeutics and develop treatment strategies.^ Studies aimed 
at understanding oncogenic transformation by mutant PikSca 
have primarily used cell-based systems, xenograft models 
and conditional transgenic models. Given the high 

frequency of PIKSCA mutations and ongoing efforts to develop 



small-molecule inhibitors, a knock-in mouse model that can 
express endogenous levels of mutant PikSca from its native 
promoter and genomic architecture would be a valuable tool to 
study tumor initiation, progression and treatment. Also, such a 
model would lend itself to identification of lesions that cooperate 
with mutant PikSca during tumor development. With this in mind, 
we generated a GEMM where we have modified the endogenous 
PikSca allele by placing a dormant copy of an oncogenic mutant 
PikSca exon 20, encoding H1047R, adjacent to the wild-type 
coding exon 20. Prior to activation of the dormant mutant allele, 
the engineered mice expressed the modified wild-type PikSca 
{PikSca^^^'^^^) allele. However, upon Cre-mediated recombination, 
the mutant PikSca^^^^^^"^^^ allele was conditionally activated in the 
mouse mammary glands, leading to its expression and tumor- 
igenesis. 

Although next-generation sequencing has enabled basepair- 
level characterization of human tumors as they evolve and 
progress,^ mouse models of cancer provide a defined 
experimentally tractable system that allows systematic sampling 
and characterization of tumors as they evolve. In an effort to 
characterize the tumors at the sequence level, we performed 
whole-exome capture and sequencing of tumors, and identified 
the emergence of spontaneous Jrp5S mutations as a potential 
cooperating event in spindle cell tumor and adenocarcinoma 
formation. We further report additional somatic mutations and 
copy-number aberrations in breast tumors from this model. In 
addition to molecular characterization, we tested the ability of a 
PI3K small-molecule inhibitor for efficacy and show that the 
tumors respond to inhibitor treatment. 
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RESULTS 

Engineering conditionally activatable PikSca^^^"^^^ mice 
To study the role of PikSca mutations in tumor initiation, 
development and progression, we have engineered a mouse 
capable of conditionally expressing the mutant PikSca H1047R 
allele driven by its native promoter. The engineered mouse, 
Pik3ca^^°'^'^\ has a wild-type PikSca exon 20 that was modified to 
contain flanking loxP sites. The modified wild-type exon is 
followed by a transcriptional stop cassette and a copy of PikSca 
exon 20 encoding an H1047R mutation (Figures 1a-d). A targeting 
vector (Figure lb) was used to modify the endogenous PikSca 
locus in mouse embryonic stem (ES) cells. Two independent 
ES cell clones containing the appropriate modification, identified 
by Southern blotting (Figures 1f and g), were used to generate 
chimeric mice that showed germline transmission of the 
PikSca^^^"^^^ allele. Intercrosses involving heterozygous 
PikSca^^^"^"^^^ ^ mice resulted in progenies with the appropriate 
genotypes at expected Mendelian ratios (Supplementary 
Table 1). Unlike the embryonic lethality observed in PikSca~^~- 
null mice,^^ prior to Cre-mediated recombination, we found that 
the homozygous pik3ca'^omwt/e20mwt apj^ais ^^^e born at the 

expected Mendelian frequency (Supplementary Table 1), indicat- 
ing that the modified PikSca^^'^'^^^ functioned analogous to the 
wild-type PikSca allele. 



Mammary gland-specific expression of PikSca^^^^^^"^^^ allele 
As PIKSCA is mutated in over 25% of human breast cancers^ we 
tested the role of PikSca'^ ^^"^^^ in breast tumorigenesis by breeding 
the PikSca^^^"""^^ mouse to an MMTV-Cre transgenic mouse.^^'^^ 
The MMTV-Cre strain expresses PI Cre recombinase under the 
control of a mammary gland-permissive MMTV-LTR promoter, 
allowing recombination and expression of PikSca^^^^^^'^'^^. We 
confirmed the expression of the PikSca^^^^^^"^^^ mutant allele and 
the wild-type PikSca allele by sequencing the cDNA corresponding 
to the PikSca mRNA extracted from the mammary glands of the 
PikSca^^^^^^'^^'^^^ mouse (Figure 1h). In contrast, RNA extracted 
from kidney showed the expression of only wild-type PikSca from 



the PikSca^^^"^^^ allele (Figure 1i), confirming mammary-specific 
expression of the mutant PikSca^^^^^^"^^^ allele. 

Expression of PikSca^^^^^^'^'^^ leads to enhanced mammary branch 
morphogenesis 

Previous studies have shown that expression of oncogenes or 
loss of tumor suppressors, such as Pten, affects branching 
and morphogenesis of mouse mammary glands.^ ^'^"^'^^ 
Hence, we studied the effect of expression of PikSca^^^^^^"^^^ on 
mammary gland development using whole-mount staining. At 
12 weeks, PikSca^^^'^^^'^^^^^ mutant mammary glands showed 
precocious lobulo-alveolar development and hyper-branched 
ductal structures compared to control mammary glands 
(Supplementary Figures la-d). We found that there was a 
~ 2-fold increase in ductal branch points in mutant animals 
{n = 6) as compared with the control group (MMTV-Cre {n = 3), 
P = 0.003; PikSca^^^""""^^^ (n = 4), P = 0.001; Supplementary Figure 
1e). By 50 weeks, the PikSca^^^^^^"^^^^^ mutant mammary glands 
showed a feathery, hyper-branched morphology compared with 
the control mammary glands (Supplementary Figures If and g). 
Further, histological sections of the PikSca^^^'^^^'^^^^^ mutant 
mammary glands showed evidence of tumor nodules at 50 weeks 
of age (Supplementary Figures 1h and i). This is similar 
to the mammary branching morphogenesis defects reported in 
previous studies involving Pten conditional null mice and 
other mammary-specific transgenic models with PI3K pathway 
activation.^ ^'^"^ 



PikSca^'^^^^^'^'^^ expression promotes mammary tumorigenesis 
Female PikSca'^^^'^""^^^^ , PikSca'^^"^""'^ ^ and MMTV-Cre mice 
were aged to see if they develop mammary tumors. We found that 
the PikSca^^^^^^"^^^^^ animals developed mammary gland tumors 
(Figures 2a -c), with a median tumor-free survival of 478.5 days. 
The majority of the animals developed tumors in the thoracic 
mammary glands (#1, 2, 3, 6 7 and 8), whereas some developed 
tumors in the inguino-abdominal mammary glands (#4, 5, 
9 and 10), and a few developed tumors in both thoracic 
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Figure 1. Generation of PikSco^^^^^^'^'^^ conditionally activatable knock-in allele, (a-d) Genomic locus encoding PikSca locus (a); targeting 
construct (b); targeted allele (c); targeted locus in the ES after removal of the neomycin cassette (d); and PikSca^^^'^^^'^^^ allele following Cre- 
mediate activation (e) are shown, (f, g) Southern blotting using a 5' probe (f ) and a 3' probe (g) was used to identify the appropriately targeted 
knock-in (ki) and wild-type (wt) allele, (h-i) Sanger sequencing of the cDNA obtained from PikSco^^^^^^^'^^^^ mammary gland (h) and kidney 
(i) following Cre-mediated activation confirms the expression of the PikSca^^^^^^'^'^'^ allele in the mammary gland. 
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Figure 2. PikSca nnice develop nnannnnary tunnors. (a, b) Mouse bearing tunnor in the abdonninal nnannnnary (a) and thoracic 

nnannnnary (b) glands. The white arrowheads indicate the location of the tunnors. (c) Kaplan -Meier plot depicting tunnor-free survival of two 
independent PlkSca^^^^^^'^'^^ lines following MMTV-Cre-nnediated activation of the latent Pik3ca^^^^^^ allele. Aninnals bearing nnannnnary tunnors 
(86.4% of line 1 and 84.2% of line 2 PikSca^^^^'^'^^^''^ fennale mice had developed tunnors) at the end point are shown in the plot. Two control 
groups, MMTV-Cre nnice and PikSca^^^^^^ , were tunnor-free for the duration of the study, (d-g) Histology of PikSca^^^^ '^'^^^^ nnannnnary 
tunnors. Analysis of H&E-stained tunnors sections show the presence of nnultiple histological types, which include fibroadenonna (d), ade- 



nosquannous carcinonna (e), adenocarcinonna (f) and spindle cell tunnor (g). (h) Western blot of PikScd 



,e20H1047R/^ 



nnannnnary tunnors re- 



presentative of the histological types shows elevated pAKT levels (lanes 3-10) and pS6 (lanes 3-10) in all tunnor types connpared with control 
nnannnnary glands (lanes 1 -2). Tunnors 9 and 10 were derived fronn passaging a prinnary spindle tunnor in the SCID nnice nnannnnary fat pad. 



and inguino-abdominal mammary glands. In contrast to 
P//c3cc7"^^^'^^^^/ + , the control P//c3cc7"^^^^^/+ and MMTV-Cre 
animals remained tumor-free for the duration of the study. The 
mammary gland tumor phenotype with a similar latency was also 
observed in PikSca^^^'^^^'^^'^^^ mice derived from a second 
independent targeted ES clone (line 2; Figure 2c), further 
confirming the specificity of the PikSca^^^'^^^'^^^ expression-driven 
tumor phenotype. 

MMTV-LTR promoter drives gene expression in both virgin and 
lactating mammary glands.^^ Given this, we compared tumor 
incidence in nulliparous and muciparous mice, and found that 
latency was significantly shortened from 492 to 465 days 
(P = 0.0002) in muciparous PikSca^^^^^^"^^^^^ mice as compared 
with nulliparous PikSca^^^'^^^'^^^^^ animals (Supplementary Figure 2), 
suggesting that pregnancy-related mammary gland changes 
potentially contribute to accelerated tumorigenesis. Alternatively, 
during pregnancy, an increase in progenitor cells undergoing 
Cre-mediated PikSca activation may contribute to the accelerated 
tumorigenesis observed.^^ 

Although the MMTV-LTR promoter directs Cre expression 
starting at day 6 after birth in mammary epithelial cells primarily, 
it is also known to drive expression in salivary glands and 
male seminal vesicles.^^ In performing histological analysis of 
salivary glands from mice bearing mammary tumors and 
seminal vesicles from males aged between 35 and 41 weeks, we 
found no evidence of tumors (Supplementary Figures 3a and b). 
However, morphologically the seminal vesicles were larger in 
PikSca^^^'^^^'^^^^^ mice as compared with the control animals. 
Histological analysis of seminal vesicle cross-sections indicated an 



increase in seminal vesicle fluid secretion leading to ductal 
distention and enlargement (Supplementary Figures 3c and d). 
However, this did not affect the breeding and fertility of 

P//c3ca^^^^'^"^^/+ males. 

Histology and signaling activation in PikSca^^^^^^'^^^-dmen tumors 
We performed histological analysis of PikSca^^^^^^"^^^^^ tumors to 
understand its pathological features and found that the majority 
of the tumors showed features consistent with fibroadenoma 
(76.9% (20/26)) (Figure 2d), whereas a few showed histopatholo- 
gical features of adenocarcinoma (15.4% (4/26)) (Figures 2e and f) 
or spindle cell neoplasia (7.7% (2/26)) (Figure 2g). 

To further characterize tumors, we tested the PI3K pathway 
activation status of the histologically distinct tumor types. In all 
tumors types, we observed elevated pAKT and pS6-kinase 
(Figure 2h), both downstream from PI3K, indicative of constitutive 
activation of the pathway in these tumors. 

In an effort to further understand the mammary tumor types 
observed, we studied the expression of basal cytokeratin 5 (CK5), 
luminal cytokeratin 18 (CK18), estrogen receptor (ER), progester- 
one receptor (PR) and a mesenchymal marker, vimentin, in 
them (Figure 3). While we found fibroadenomas and adenocarci- 
nomas to be CK5 + , CK18 + , ER+ and PR + , the spindle cell tumors 
were ER~/PR~ (Figure 3o) and CK5~ (data not shown). Interest- 
ingly, this is reminiscent of the ER~ and PR~ status reported for 
human mammary spindle cell tumors.^^'^^ In addition, the spindle 
cell tumors were CK1 8 ^/vimentin ^ (Figures 3n and p), indicating 
that the cells are undergoing epithelial -mesenchymal transition 
(EMT). 
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Figure 3. Innnnunofluorescence staining of Pik3ca^ nnannnnary tunnors. (a-p) Serial sections of a fibroadenonna (a-d), two adenocarci- 

nonnas (e-h and i-l) and a spindle cell tunnor (m-p) stained with H&E (a, e, i, m), anti-CK18 and anti-CK5 (b, f, j), anti-ERa (c, g, k), anti-ERa and 
anti-PR (o), anti-PR (d, h, I), and anti-CK18 and anti-vinnentin (n, p) antibodies. The section in panel p was obtained fronn a third-passage 
spindle cell tunnor explant. 



Genomic analysis of PikSca^ -driven mammary tumors 
Prior to performing genomic analysis of PikSca^^^"^^^ mammary 
tumors, given the long latency for tumor formation, we first 
tested whether tumor explants derived from mammary tumors 
would grow as xenografts in SCID mice. The majority of 
fibroadenomas, adenocarcinomas and spindle cell tumors that 
we tested formed tumors. Interestingly, the histology of some 
xenograft tumors derived from fibroadenoma explants showed 
features of adenocarcinoma, indicating that fibroadenomas are 
likely benign and that only a subset of cells with tumorigenic 
potential within the original explants contributed to the develop- 
ment of tumors upon passage (Supplementary Figures 4a and b). 
However, the spindle cell tumor explants maintained their 
histological features during passage (Supplementary Figures 4c 
and d). 

We performed microarray-based gene expression analyses of 
different mammary tumor subtypes and control mammary glands 
to further understand their molecular characteristics. Hierarchical 
clustering of expression data using markers for epithelial cells, 
stem cells and EIV1T^^'^° showed that primary fibroadenomas, 
adenocarcinomas and tumors derived from passaging the 
explants from these tumor types clustered together, indicating a 
common origin and a shared set of gene expression changes 
(Figure 4a). In contrast to the spindle cell type, both primary and 
passaged tumors, clustered together showing features of EMT, 



suggests a distinct cellular origin or is reflective of a distinct set of 
cooperating mutations in this tumor type (Figure 4a). 

Whole-exome capture and sequencing have been applied in 
identifying somatic mutations in the coding regions of genomes.^^ 
In order to understand the emergence of secondary mutations in 
the various mammary tumor types observed in our model, we 
performed whole-exome capture and sequencing of a select set 
of samples (Supplementary Table 2). Besides the presence of the 
engineered PikSca H1047R mutation, the tumors also contained 
several additional somatic changes (Supplementary Table 3 and 
Supplementary Figure 5). We found that the fibroadenomas had a 
low level of somatic mutations (between 2 and 13), followed by 
adenocarcinomas (between 4 and 61 mutations) and spindle cell 
tumors (between 44 and 88 mutations). Notably, we found the 
Jrp53 I192N mutation in an adenocarcinoma and an R245H 
mutation in a spindle cell tumor. The R245H mutation is 
equivalent of R248H TP53 hotspot mutation found in human 
cancers.^^ Additionally, to understand the prevalence of the Jrp53 
mutation in the oncogenic P/7c3cG-d riven mouse tumors, we 
sequenced the coding exons of Trp53 in an independent spindle 
cell tumor and identified an A135V mutation (Figure 4b, and 
Supplementary Tables 3 and 5). All Jrp53 mutations identified in 
this study occur in the DNA-binding domain and are known to 
impair Trp53 function and have been reported in human 
cancers.^^"^"^ Further, the Jrp53 mutation and the majority of 
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Figure 4. Expression profile and genonnic aberrations in P/7c3ca^^^^'°'^^'^-d riven nnannnnary tunnors. (a) Heatnnap derived by hierarchical clus- 
tering of the histologically distinct nnannnnary tunnor types using the expression level for the set of genes indicated. Mannnnary tunnors of 
different histological types show distinct expression profiles. Spindle cell tunnors are ClaudinAow and show elevated expression of EMT genes 
including Snail, Twist and Zebl/2. Sannples shown are spindle cell tunnors (1 -4; 2-4 are serially passaged tunnors derived fronn tunnor 1), control 
nnannnnary gland (Mg) (5-10), fibroadenonna (11-15) and adenocarinonna (16-20). Tunnor sannples 17 and 18 were derived by serial passage of 
tunnor 16. Sinnilarly, tunnor sannple 19 was derived fronn passaging tunnor 1 1. Sannples 5, 7 and 8 were nnannnnary glands fronn control groups. 
Sannples 6, 9 and 10 were nnatched nnannnnary glands fronn aninnals bearing tunnors 12, 13 and 14, respectively, (b) Spontaneous Trp53 
nnutations in adenocarcinonna and spindle cell tunnors. The genonnic region, the Trp53 exons, sequencing reads (black lines) fronn the region of 
interest with nnutation (red dots) are shown. The Sanger sequencing trace file depicting the A135V nnutation is also shown. The three Trp53 
nnutations identified nnapped on to its donnain architecture are shown. PR, proline-rich donnain; Reg, carboxy-ternninal regulatory donnain; TA, 
transactivation donnain; Tet, tetrannerization donnain; DBD, DNA-binding donnain. 



the nnutations identified in the prinnary spindle cell tunnor were 
also found in the explant-derived tumors following passage, 
indicating that the primary spindle tumor was likely genetically less 
heterogeneous (Supplementary Figure 5). As observed in the 
spindle cell tumors, the majority of the mutations present in the 
primary adenocarcinoma were also present in the tumors derived 
following transplantation and passaging (Supplementary Figure 5). 
Analysis of the effect of protein-altering somatic changes using 
SIFT^^ and/or PolyPhen^^ identified 39 mutations, including 
those in Plkl, Tssk2 and Smgl kinases, to have a functional 
effect suggesting that some of these may act as drivers 
(Supplementary Table 3 and Supplementary Figure 5). Interest- 
ingly, 5MG7 mutations have been reported previously in human 
breast cancer.^'' 

In addition to whole-exome mutational analysis, we profiled 
the tumors on CGH arrays to understand chromosomal copy- 
number alteration. We found that both fibroadenomas and 
adenocarcinomas showed few regions of copy-number aberra- 
tions (Supplementary Table 4). In contrast to fibroadenomas and 
adenocarcinomas, we found several regions with copy-number 
alterations in spindle cell tumor samples when compared with 
normal mammary glands (Supplementary Table 4 and Supple- 
mentary Figures 6s and 7a). Spindle cell tumors are also 
characterized by two regions of chromosomal loss that fall within 



chromosomes 11 and 12, both of which contain several coding 
genes. In particular, deletion in chromosome 1 1 resulted in loss 
of Nfl (Supplementary Figure 7) and is consistent with its 
downregulated expression (log2 ratio of —2.09; P = 8.338e— 05; 
Figure 4a), suggesting that the Ras-MAPK pathway might also be 
engaged in these tumors. 



PikSca^^^^^^"^^^^^ mammary tumors respond to PI3K inhibitor 
Engineered mouse models provide an ideal platform for candidate 
drug testing in a preclinical setting. As the mice in this study 
develop mammary tumors after a long latency, we used our 
explant-derived tumor model to test its utility in assessing drug 
efficacy. Explants derived from spindle cell tumors with Jrp53 
mutation were implanted subcutaneously near the mammary fat 
pad and treated with a PI3K inhibitor, GDC-0941,^^'^^ once the 
tumors had reached a volume of 200-300 mm^. Tumor-bearing 
mice were treated in four groups of nine animals each with 50, 
100 and 150mg/kg GDC-0941 or a vehicle. We found that animals 
treated with GDC-0941 showed the highest tumor growth 
inhibition (TGI) of 85% at 150mg/kg, although TGI was also 
observed at the other tested doses compared with vehicle 
treatment (Figure 5a), indicating that these tumors were still 
dependent on the mutant Pik3ca signaling for their growth. 
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Days Post-Treatment 

Figure 5. In vivo antitumor activity of PI3K inhibitor, (a) Pik3ca ^^^^^^"^^^ mammary tumor explants grown as xenografts respond to treatment 
with GDC-0941. 35, 41 and 85% on the graph represent %TGI with respect to control {n = 9) for each of the treatment dose {n = 9 for each 
dose), (b) Tumor from animals treated with GDC-0941 show evidence of PI3K pathway inhibition. 



Further, log-rank tests of these data regarding time to progression 
showed that all the drug treatment groups showed a statistically 
significant difference when compared with the vehicle-treated 
animals at all the doses tested (P< 0.0001 at 150mg/kg; 
Supplementary Figure 8). Consistent with the TGI observed, 
tumors from mice treated with GDC-0941 showed a significant 
decrease in pAKT and pS6 at least up to 10 h following treatment 
(Figure 5b), confirming the contribution of PI3K inhibition in tumor 
growth inhibition. 



DISCUSSION 

GEMMs of cancer are invaluable for understanding tumor 
initiation, progression and therapy evaluation.'^^ In addition, 
application of next-generation sequencing technologies to these 
models can provide valuable information on the molecular 
aberrations that arise during the evolution of tumors. Here we 
describe a conditionally activatable PikSca'^^^'^^^ mouse model of 
breast cancer and show that these mice develop mammary 
tumors of multiple histological types, including fibroadenomas, 
adenocarcinomas and spindle cell neoplasia. We found that the 
fibroadenomas and adenocarcinomas were positive for hormone 
receptors and cytokeratins (CK5 + /CK18 + ), indicating they are 
epithelial and myoepithelial in origin. By contrast, the spindle cell 
tumors, whereas being positive for cytokeratin 18, were distinct in 
that they were ER/PR-negative but vimentin-positive. This is 
reminiscent of human mammary spindle cell tumors.^^'^^ 

PIK3CA mutations are observed in multiple human breast cancer 
subtypes, including of hormone receptor-positive (HR^), Her2 + 
triple-negative (TN) and metaplastic types.^^ This is consistent with 
the multiple histological types observed in our breast cancer 
model and other published transgenic PikSca'^^^'^^^ breast tumor 
models.^^"^^ However, the latency of breast tumor development 
in our model is longer compared with these studies where the 
PikSca^^^'^'^^ mutant cDNA was expressed under the control of 
either an ROSA26 locus promoter (median survival ~140 days in 
line A; >500 days in line NLST)^^ or a chicken p-actin promoter 
(average latency 214 ±22.6 days)^^'^^'^^ following activation of its 
expression using MMTV-Cre or induction of expression from a 
tetracycline-inducible promoter (average latency 210 days).^^"^^ 
This likely reflects the fact that our knock-in model expresses the 
mutant PikSca allele from its endogenous promoter present in its 
native genomic configuration and hence more closely mimics the 
P//GC/A^'^'''''^-d riven tumorigenesis observed in human tumors. 

The advent of next-generation sequencing technology has 
allowed comprehensive characterization of cancers at the 
sequence level."^^ In applying next-generation sequencing tech- 
nology, we have characterized tumors derived from our mouse 



model to understand additional hits required for breast tumor 
development following expression of the PlkSca^^^"^^^ mutant 
allele. Among several candidate secondary hits that may 
cooperate with PikSca^ we report the occurrence of Jrp53 
mutations in two spindle cell tumors and one adenocarcinoma 
sample. All three Trp53 mutations, R245H, A135V and I192N 
(equivalent to human TP53 mutations R248H, A138V and I195N), 
observed in this study occur in human cancers.^^"^"^ JP53 and 
PIKSCA mutations co-occur in human breast cancer."^^ Recently, 
loss of Jrp53 has been shown to accelerate tumorigenesis in an 
oncogenic PikSca transgenic mouse breast tumor model. The 
spontaneous emergence of Jrp53 mutations in our tumor model 
mimics the cooperative JP53 lesions observed in human breast 
cancer. Hence, our mouse model will be ideal to study therapeutic 
intervention strategies for P/7c3ca-d riven mammary tumors. In line 
with this, we demonstrate the utility of our model in testing the 
efficacy of GDC-0941, a PI3K inhibitor, in a subset of mammary 
tumor with EMT features. A previous report studied the efficacy of 
NVP-BEZ235, a dual pan-PI3K/mT0R inhibitor, in a PikSca^^^"^^^ 
transgenic lung tumor model. However, given that PIKSCA 
mutations are more frequent in human breast cancers, our model 
can serve as a genetically defined system to test the efficacy of 
PI3K inhibitors in breast cancer and potentially help to develop 
treatment regimens for clinical drug testing. Also, the model can 
be used to understand the development of resistance to targeted 
therapeutics. In this context, it remains to be seen whether the 
c-Myc-mediated mechanism of resistance, recently reported in a 
transgenic oncogenic PikSca mammary tumor model, has a role in 
our model system. ^^"^^ 

Besides its utility in studying mammary tumorigenesis, this 
model can be used to study other cancers where PIKSCA mutations 
have been observed, by activating the mutant PikSca allele using 
an appropriate tissue-specific Cre line. In conjunction with next- 
generation sequencing technologies, our model can be used to 
discover and understand additional driver mutations that co- 
operate with mutant PikSca in other tumor types. Given that PI3K 
pathway activation has been implicated in resistance to therapy,"^^ 
the PikSca'^ ^^"^^^ mice can also be used to model drug resistance 
and study therapeutic strategies to overcome resistance. 

MATERIALS AND METHODS 

Generation of conditionally activatable PikSca^^^"^^^ mice 
We used a targeting vector template (Supplementary Figure 9) to assemble 
the final targeting vector. Using mouse genomic DNA from C57B/6N ES 
cells as template, a 2.045-kb fragment encompassing PikSco exons 18 
and 19 (left arm), a 588-bp region containing PikSca wild-type exon 20 
(mid-arm), and a 3.137-kb genomic DNA containing exon 20 and an 
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additional 3' intronic sequence (right arm) were PCR-amplified and used 
for constructing the vector. Prior to cloning into the template targeting 
vector, codon 1047 encoded by exon 20 in the left arm was mutated to 
code for histidine (R1047H) using standard mutagenesis techniques. In 
cloning the mid-arm into the targeting template vector, the wild-type exon 
20 region was modified by addition of a human growth hormone (hgh) 
polyA signal present in the vector. A neomycin (neo) expression cassette 
and a 4 X transcriptional stop cassette"^^ were part of the vector used to 
assemble the targeting construct. In the final configuration, the neomycin 
cassette was flanked by 'frt' sites to facilitate its excision following 
modification of the ES cells. Also, the modified wild-type exon was flanked 
by 'loxP' sites to facilitate its removal, upon which the mutant exon 20 can 
be activated (Figure 1). Accuracy of the targeting vector was verified by 
Sanger sequencing. The targeting construct was electroporated into C57B/ 
6N ES cells and selected for neomycin resistance. Appropriately targeted 
ES clones were identified by 5' and 3' Southern blotting. The 5' probe 
recognized a 6.6-kb wild-type fragment and a 9.1 -kb fragment in the 
targeted ES cell (Figures 1c and f). Similarly, the 3' probe recognized a 
6.9-kb wild-type fragment and a 4.8-kb fragment in the ES cells where the 
PikSca allele was modified (Figures 1c and g). Following removal of the neo 
cassette and confirmation of the architecture of the modified genomic 
region encoding PikSca by PCR and sequencing, the ES clones were 
injected into blastocysts to generate chimeric mice. A 290-bp PCR product, 
as opposed to a 201 -bp fragment in wild-type mice, generated using 
primers 20F (AGCCAGCAGACAATAATTCTTAGCACA) and 20R (CAGTGCTT 
GAACATTGGAGGTCAGT) was used to follow germline transmission and 
also genotype-modified, PikSca allele-bearing mice. To activate the latent 
PikSca^^^^'^^ mutant allele in mammary glands, we bred this mouse with an 
MMTV-Cre line (Jackson Labs, Bar Harbor, ME, USA; # 003551; line A). Mice 
that showed any body condition score < 2,^^ a hunched posture and 
> 20% loss of body weight were killed as per the Institutional Animal Care 
and Use Committee (lACUC) guidelines. Our endpoint study included 
mice with tumors that reached ~2500mm^ and also those with palpable 
tumors that were killed in accordance with lACUC recommendations. All 
mice were maintained in our animal facility as per the lACUC guidelines. 

Histological, immunohistochemical and whole-mount analysis 
Five-micrometer, formalin-fixed, paraffin-embedded specimens was used 
for routine hematoxylin and eosin (H&E) staining and histology evaluation. 
For whole-mount analysis, the mammary glands were dissected, spread 
and fixed with Carnoy's fixative for 2-4 h. Tissues were hydrated and 
stained in Carmine alum overnight. 

Immunofluorescence staining was performed using 10-|im sections 
Tissue-Tek OCT (Sakura Finetek, Torrance, CA, USA)-embedded frozen 
samples. The sectioned samples were fixed in 4% paraformaldehyde for 
lOmin and then blocked for 30min with PBT (PBS with 0.1% Triton) 
containing 1% bovine serum albumin. The blocked sections were then 
stained with an appropriate primary antibody diluted in PBT with 0.1% 
bovine serum albumin overnight at 4°C in a humidified chamber. The 
slides were washed three times in PBT and then incubated with 
appropriate secondary antibodies for 60min at room temperature in a 
humidified chamber. Unbound secondary antibody was removed by 
washing with PBT. The Prolong Gold anti-fade reagent (Molecular Probes, 
Eugene, OR, USA) was used to mount the slides. Primary antibodies against 
cytokeratin 5 (ab53121), cytokeratin 18 (ab668), PR (ab2764) and vimentin 
(ab45939) were obtained from Abeam (Cambridge, MA, USA). The ERa (Ab- 
21) was purchased from Thermo Fisher Scientific (Waltham, MA, USA). 
Appropriate secondary antibodies conjugated with Alexa 488 or 647 
(Molecular Probes) were used for detecting the bound primary antibody. 

Western blot analysis 

Frozen tumors were weighed and lysed with a pestle PP (Scienceware, 
Pequannock, NJ, USA) in cell extraction buffer (Invitrogen, Carlsbad, CA, 
USA) supplemented with protease inhibitors (Roche Applied Science, 
Indianapolis, IN, USA) 1 mM phenylmethysulfonyl fluoride, and phospha- 
tase inhibitor cocktails 1 and 2 (Sigma, St Louis, MO, USA). Protein 
concentrations were determined using the Pierce BCA Protein Assay kit 



(Rockford, IL, USA). For western blots, equal amounts of protein were 
separated by electrophoresis using NuPage Bis-Tris 4-12% gradient gels 
(Invitrogen) and then transferred onto nitrocellulose pore membranes 
using the iBIot system (Invitrogen). The primary antibodies, pAkt (Ser473), 
pAkt (Thr308), total Akt, pS6 (Ser235/236), pS6 (Ser240/242) and total S6, 
were obtained from Cell Signaling Technology (Danvers, MA, USA). Anti- 
actin antibody was purchased from Sigma-Aldrich. Chemiluminescence 
western blotting detection (Amersham Biosciences, Pittsburgh, PA, USA) 
coupled with appropriate secondary, horseradish peroxidase-conjugated 
secondary IgG antibodies was used to detect the primary, antibody-bound 
proteins. 

DNA/RNA preparation 

DNA from tumors and normal tissues was prepared using the QiagenAII- 
Prep DNA/RNA kit (Qiagen, Valencia, CA, USA). RNA samples were prepared 
using the RNeasy Mini kit (Qiagen). 

Expression analysis 

The quantity and quality of total RNA samples were determined, 
respectively, using an ND-1000 spectrophotometer (Thermo Scientific, 
Wilmington, DE, USA) and Bioanalyzer 2100 (Agilent Technologies, Santa 
Clara, CA, USA). A Cy dye-labeled cRNA preparation and array hybridization 
were prepared as per the manufacturer's (Agilent Technologies) instruc- 
tions. Briefly, total RNA sample was converted to double-stranded cDNA 
and then to Cy dye-labeled cRNA using an Agilent Quick Amp Labeling kit. 
The labeled cRNA was purified using an RNeasy mini kit (Qiagen). cRNA 
yield and Cy dye incorporation were determined using the ND-1000 
spectrophotometer (Thermo Scientific). A 750-ng weight of the labeled 
cRNA was fragmented and hybridized to Agilent Whole Mouse Genome 
4x44K arrays. All samples were labeled with Cy5 dye and hybridized 
against a Cy3 dye-labeled universal mouse reference (Stratagene, La Jolla, 
CA, USA). Samples were hybridized for 1 7 h at 65 °C with constant agitation 
on a Pro Hybridization station HS 4800 (Tecan US, Durham, NC, USA). 
Following hybridization, the arrays were washed, dried and scanned using 
an Agilent scanner. The Agilent Feature Extraction software 10.7 was used 
to analyze the acquired array images. The data were processed using 
methods implemented in the limma R package.^^ First, data were 
background-corrected, after which a within-array loess fit normalization 
and a between-array quintile normalization were applied.^^ Differentially 
expressed gene signatures were obtained using limma's linear model fit 
and probes with a false discovery rate-adjusted P-value of 0.05 or lower, 
and a log2 fold change of at least 2, were deemed differentially expressed. 
Hierarchical clustering of samples and EMT-related genes was computed 
on the mean centered gene expression profiles using the Euclidian 
distance metric and complete linkage clustering. 

CGH arrays 

Mammary tumor samples from PikSca^^^^^^^'^'^^^ mice were profiled on 
Agilent CGH arrays according to the manufacturer's protocol (Agilent 
Technologies). Briefly, 500 ng DNA of both tumor and a reference sample 
(C57BL/6J mouse; The Jackson Laboratory, ME, USA) were digested with 
Alu\ and Rsa\ (Promega, Madison, Wl, USA) and subsequently purified using 
the QIAprep Spin Miniprep kit (Qiagen). Digested samples were labeled 
with Cy5-dUTP (test sample) or Cy3-UTP (reference sample) using the 
Genomic DNA Labeling Kit Plus (Agilent Technologies). Test samples were 
pooled with the reference and subsequently purified using MicroCon 
YM-30 (Millipore, Billerica, MA, USA). Labeled probes were mixed with 
Cot-1 DNA (Invitrogen), 10 x blocking agent and 2 x Hi-RPM hybridization 
buffer (Agilent Technologies), and hybridized to an Agilent 244K Mouse 
Genome CGH microarray. The samples were hybridized for 24 h at 67 °C 
with constant agitation using a Pro Hybridization station HS 4800 (Tecan 
US). The arrays were washed, dried and scanned using the Agilent scanner 
according to the manufacturer's protocol (Agilent Technologies). Individual 
log2 ratios of background-subtracted signal intensities were obtained from 
the Agilent Feature Extraction software version 10.7. The log2 ratios were 
centered to a median of zero and the resulting log2 ratio values for each 
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probe were segmented using GLAD.^^ All genes within the genomic 
bounds of a given GLAD-derived segment were assigned the mean copy- 
number value of the probes within that segment. Copy-number values 
^0.3 log2 ratio units represented gain and values < = -0.3 log2 ratio units 
represented loss. 

Exome capture and sequencing 

Targeted sequence capture was performed using 3 |ig of genomic DNA and 
the SureSelectXT Mouse All Exon kit according to the manufacturer's 
protocol (Agilent Technologies). The Mouse All Exon kit is designed to 
capture 49.6 Mb of a targeted region and covers 221 784 exons within 
24 306 genes. The pre-capture library was amplified using four PGR cycles, 
whereas 12 PGR cycles were used in post-capture amplification. Fragment 
size distribution of post-capture amplified libraries was determined using 
Bioanalyzer 2100 using a DNA high-sensitivity chip (Agilent Technologies). 
Concentration of the libraries was measured by the Kapa library 
quantification kit (Kapa Biosystems, Woburn, MA, USA). Each library was 
sequenced on HiSeq 2000 to generate 2 x 75 bp reads per the 
manufacturer's instructions (lllumina, San Diego, GA, USA). The sequencing 
reads were mapped to the UGSG mouse genome (NGBI37/mm9) using the 
BWA software^° set to default parameters. Local realignment, duplicate 
marking and raw variant calling were performed as described previously .^^ 
Single-nucleotide polymorphisms represented in dbSNP Build 131^^ were 
used to remove known germline variations. In addition, variants that 
were present in both tumor and normal samples were removed as germline 
variations. The remaining variations present in the tumor sample, but 
absent in the matched normal, were predicted to be somatic. The predicted 
somatic variations were additionally filtered to include only positions with a 
minimum of 1 0 x coverage in both the tumor and matched normal, as well 
as an observed variant allele frequency of < 1 % in the matched normal. The 
effect of protein somatic mutation on gene function was predicted using 
SIFT^^ and PolyPhen^^ (Supplementary Table 3). 

Sanger sequencing 

Spindle cell tumor from animal #113 (Supplementary Table 2) was analyzed 
for Trp53 mutations using DNA derived from formalin-fixed, paraffin- 
embedded material. We used a pair of nested primers (outer and inner) to 
PGR-amplify the Trp53 exons (Supplementary Table 5). The inner primers were 
m13-tagged to facilitate sequencing. The primers used for amplification are 
provided in Supplementary Table 5. PGR was performed using Platinum Taq 
kit (Invitrogen) as per the manufacturer's protocols. The following parameters 
were used for PGR amplification: 94 °G for 2min, followed 30 cycles of 
denaturation at 94 °G for 30 s, annealing at 56 °G for 30 s and extension at 
72 °G for 1 min. Samples, prior to analysis, were held at 4°G after a final 
extension at 72 °G for 8 min. PGR products were purified using the ExoSAP-IT 
kit (Affymetrix, Santa Glara, GA, USA) and sequenced using ABI3730xl per the 
manufacturer's instructions (Life Technologies, Foster Gity, GA, USA). 
Sequencing reads were analyzed using Sequencher (GeneGodes, Ann Arbor, 
Ml, USA). 

Drug efficacy study 

Mammary tumor pieces were implanted subcutaneously near the 
mammary fat pad of 8- to 10-week-old female G.B-17 SGID beige mice 
weighing between 20 and 25 g (Gharles River Laboratories, Wilmington, 
MA, USA). Tumors were monitored until they reached a mean tumor 
volume of 200-300 mm^ and animals were randomly distributed into four 
groups of nine animals each before initiation of dosing. Mice were orally 
gavaged daily for 1 3 days with 50, 1 00 and 1 50 mg/kg GDG-0941 dissolved 
in 0.5% methylcellulose with 0.2% Tween 80 (MGT). Tumor sizes and 
mouse body weights were recorded twice weekly over the course of the 
study. Tumor volumes were measured in two dimensions using a caliper 
and calculated using the following formula: Tumor size (mm^) = (Longer 
measurement x Shorter measurement^) x 0.5. Percent weight change was 
calculated using the following formula: Group percent weight chan- 
ge = ((New weight-Initial weight)/lnitial weight) x 100. Mean tumor 
volumes (±s.e.m.) and percent weight change were calculated and 



plotted using Kaleidagraph (version 4.03; Synergy Software, Reading, PA, 
USA). Mice with tumor volumes ^ 2000 mm^ or with losses in body weight 
of ^20% from initial body weight were promptly killed based on the 
lAGUG guideline. To analyze the repeated measurement of tumor volumes 
from the same animals over time, a mixed modeling approach was used.^^ 
This approach addresses both repeated measurements and modest 
dropouts owing to any non-treatment-related death of animals before 
study end. Gubic regression splines were used to fit a non-linear profile to 
the time courses of log2 tumor volume at each dose level. These non-linear 
profiles were then related to dose within the mixed model. Tumor growth 
inhibition as a percentage of vehicle (%TGI) was calculated as the 
percentage of the area under the fitted curve (AUG) for the respective dose 
group per day in relation to the vehicle, using the formula: 
%TGI = 100 X (1 -AUGoose/AUGveh). To get uncertainty intervals for %TGI, 
the fitted curve and the fitted covariance matrix were used to generate a 
random sample as an approximation to the distribution of %TGI. The 
random sample is composed of 1000 simulated realizations of the fitted- 
mixed model, where %TGI has been re-calculated for each realization. Our 
reported uncertainty interval is values for which 95% of the time the re- 
calculated values of %TGI will fall in this region given the fitted model. The 
2.5 and 97.5 percentiles of the simulated distribution were used as the 
upper and lower uncertainty intervals. Plots were generated using R 
(version 2.8.1; R Development Gore Team 2008, R Foundation for Statistical 
Computing, Vienna, Austria) and Excel. Data were analyzed using R and the 
mixed models were fitted within R using the nime package.^^ 
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